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Abstract: We address the issue of single top production in association with a W boson 
at the Large Hadron Colhder, in particular how to obtain an accurate description in the 
face of the top pair production background given that the two processes interfere with each 
other. We stress the advantages of an MC@NLO description, and find that for cuts used to 
isolate the signal, it makes sense to consider Wt as a well-defined production process in that 
the interference with tt production is small, and the cross-section of the former is above 
the scale variation uncertainty associated with the latter. We also consider the case where 
both Wt and tt production are backgrounds to a third process (Higgs boson production 
followed by decay to aW boson pair), and find in this context that interference issues can 
also be neglected. We discuss the generalization of our results to other situations, aided by 
a comparison between the MC@NLO approach and a calculation of the WWbb final state 
matched to a parton shower. 
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1. Introduction 

Top physics is an active research area, not least because the mass of the top quark is close 
to the scale of electroweak symmetry breaking. Given that one expects theories beyond the 
Standard Model (SM) to explain this symmetry breaking, it follows that the top sector is a 
potentially sensitive probe of new physics effects. Top quark production is also of interest 
within the SM, for precision measurements of masses and couplings, and as a background 
to other processes. 

Single top physics (in which a i or t is produced without its accompanying antiparticle) 
is of particular interest, given that the LO processes are all purely electroweak in nature. 
The corresponding diagrams are shown in figure 1, and there are three distinct produc- 
tion modes. The first two are conventionally referred to as the s- and t-channel modes 
(depending on the nature of the exchanged W boson), and have been recently identified 
(in combination) at the Tevatron [1, 2]. The third mode is that of Wt production, and 
is distinguished by the presence of a W boson accompanying the single top quark in the 
final state. Its cross-section is rather too small to be observed at the Tevatron, but makes 
up about 20% of the total single top cross-section at the LHC, whilst the s-channel mode 
becomes negligible. 

It is desirable to isolate Wt production for a number of reasons. Firstly, it is sensitive 
to new physics effects which modify the Wtb vertex of the Standard Model, but not to 
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Figure 1: The three SM single top production modes, shown at LO: (1) s-channel production; (2) 
f-channel production; (3) Wt production. Double lines represent the top quark. 

effective 4-fermion interactions (which mainly affect the s- and t-channel modes). Thus, it 
is in principle a different test of BSM theories (see e.g. [3] for a model-independent analy- 
sis) . Secondly, it offers complementary information on the Wtb vertex within the Standard 
Model (e.g. the value of the CKM matrix element Vti, in connection with the possibility of 
a fourth generation [4, 5, 6, 7, 8]). Furthermore, Wt production is a background to many 
processes, including both neutral and charged Higgs boson production. In such cases one 
must evaluate the sum of top pair production and Wt production as a background, and it 
is important that this be done consistently. 

The cross-sections for single top production in the s- and f-channel modes have been 
calculated at NLO in QCD in [9, 10, 11, 12], with decay effects studied in [13, 14, 15]. 
Recently, the t-channel mode was calculated at NLO in the four-flavor scheme, in which 
initial state b quarks are generated from gluon splitting [16]. The Wt cross-section was first 
considered in [17], and has also been calculated at NLO in QCD [18, 19]. Furthermore all 
three production modes have been implemented in the MC@NLO software framework for 
combining NLO matrix elements with a parton shower algorithm [20, 21, 22], including spin 
correlations in the top decay products using the method outlined in [23]^. This constitutes 
the state of the art for the description of single top physics^, combining the reduction of 
theoretical systematic uncertainties which result from adopting an NLO description of the 
hard event with the high multiplicity, hadron-level events resulting from the parton shower 
algorithm. The latter can furthermore be interfaced with detector simulations. 

The calculation of the Wt mode at NLO is non-trivial (and its implementation in 
MC@NLO is no exception), as discussed in [22], due to the fact that the Wt production 
process (at NLO) interferes with tt production (at LO), with decay of the t (or t quark 
in the case of Wt production) . It becomes unclear whether it is meaningful to define Wt 
production as a separate signal in its own right, or whether one should instead consider 
combining Wt and tt production, i.e. only consider given final states comprised of W 
bosons (or their decay products) and b quarks. The latter approach has practical problems 

'^For a recent study of spin correlations in single top production, see [24]. 

^The s- and t-channel processes at NLO were very recently interfaced with a parton shower in the 
POWHEG framework [25]. 
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of its own, and the question arises of how to obtain the theoretically most accurate descrip- 
tion of Wt production. In [22] two definitions of the Wt mode were given, such that the 
difference between them measures the interference between Wt and tt production. This 
interference is not guaranteed to be small over all of phase space, but by comparing the 
results obtained from the two codes it is possible to ascertain whether or not it makes sense 
to be considering Wt production as an independent process. This problem is not explicitly 
encountered in previous analyses of the Wt mode by experimental collaborations, which 
use LO Monte Carlo descriptions (based on the five flavor scheme, in which b quarks are 
present in the initial state). 

The aim of this paper is to further investigate these questions, and to investigate var- 
ious strategics of how to theoretically describe the Wt mode. There are two issues to 
consider: the reduction of interference between Wt and tt production (i.e. to what extent 
the former is well-defined), and furthermore whether Wt can be efficiently isolated as a 
signal or reduced as a background. The answer to both of these questions depends on the 
experimental cuts applied. However, they are related issues in the sense that cuts used to 
isolate the Wt signal will also influence the interference between Wt and ti production. 

The paper is organized as follows. In the next section, we recall the interference 
problem between the Wt and tt production processes. In section 3 we consider the isolation 
of Wt production as a signal, and show that for fairly loose cuts the Wt cross-section is 
visible above the scale dependence of the tt background, and that interference between the 
two processes is small. In section 4 we consider the case of Wt prodiiction as a background 
to a third process, that of a Higgs boson decaying to a bb pair, and show that in this case 
interference effects are also small, such that one may consider Wt and tt production as 
distinct background processes. In section 5 we examine another approach for describing Wt 
production, namely that of consistently combining Wt and tt-like diagrams, and consider 
the relative merits with respect to the MC@NLO calculation. We discuss our results in 
section 6 and conclude. 

2. Interference problem 

At NLO in QCD, the Wt mode (shown at LO in figure 1) includes the corrections shown in 
figure 2. Such diagrams can also be thought of as the production of a top quark pair, with 



Figure 2: A subset of diagrams contributing to Wt production at NLO, consisting of top pair 
production, with weak decay of one of the final state top particles. 
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decay of the t (or t quark in the case of single antitop production in association with a W 
boson) . A problem then occurs if the invariant mass of the final state Wb system is close 
to the top mass, in that the propagator for the intermediate top particle becomes large. 
More specifically, the Wt and cross-sections are well-defined at LO, with awt < (^tt- The 
NLO correction to Wt, including the diagrams shown in figure 2, then represents a huge 
correction, effectively undermining the perturbative description of the Wt mode. There 
are two main viewpoints for how to deal with this problem. 

The first, and at first sight the most theoretically rigorous approach, is to conclude that 
Wt production does not exist, and that its status as an independent production process is 
an accident of perturbation theory at leading order. One then considers given final states, 
and sums all possible Feynman diagrams to a given order in as and aEW which lead to 
those final states. In this case the relevant final states are WWb and WWbb, i.e. where b 
may denote (anti-)bottom quarks as appropriate^, and WW denotes W^W~. Disregard- 
ing other backgrounds, the WWb state receives contributions from LO Wt production (as 
depicted in figure 1, following decay of the top), whereas WWbb receives contributions 
from NLO Wt graphs as well as LO tt graphs. However, the use of the terms Wt (or 
tt) production does not really make sense in this viewpoint, as only given final states are 
physically meaningful. Although this approach naturally incorporates interference effects, 
it suffers from severe phenomenological and technical problems in practice. In particular, 
corrections to the WWbb final state arising from NLO QCD contributions to tt production 
(followed by decay of both top particles) have not been computed in the above superposi- 
tion of Wt and ti. However, these corrections are known to be large for tt production (as 
we will see), significantly limiting the accuracy of the description if they are not included. 

There are also practical reasons why separation of Wt and tt production is useful. If 
one is trying to isolate single top production as a signal, one wishes to efficiently obtain 
samples of Monte Carlo events corresponding to this signal. If one only has a tool for 
generating the combination of single and top pair production, most of the generated events 
will fail the signal cuts, such that event generation efficiency for the WWb final state is low. 

These problems motivate a second viewpoint, namely that one is allowed to consider 
Wt as a well-defined process, subject to adequate cuts. This relies upon the observation 
that when cuts are applied to isolate the WWb final state, interference effects may be 
small in practice. Thus one may consider them, for practical purposes, as arising from tt 
production with no subsequent interference between single top and top pair production. 
To be more specific, let us split the full NLO corrections to the LO Wt amplitude into two 
parts as follows: 

Awt = Ai + A2, (2.1) 

^This assumes a calculational framework in which initial state b quarks are present (i.e. a five flavor 

number scheme for the parton densities). The discussion is modified in a four fiavor scheme, in which ah b 
quarks are generated explicitly from gluon splittings, as we will see later in the paper. 
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where the first term on the right-hand side contains diagrams with only one top quark 
(either real or virtual), and the second term corresponds to the diagrams in figure 2 con- 
taining two top particles in an intermediate state. The squared amplitude is then given 

by 

\Awtf oc + 2Re[4v42] + |^2p. (2.2) 

One can choose to interpret the first term to be a part of the Wt production process (which 
has a well-defined NLO QCD correction), and the third term to be due to LO production. 
This interpretation is only meaningful provided the interference term 2Re[^|^2] is small, 
and whether or not this is the case depends strongly on the cuts applied. We will see later 
in the paper that cuts that are typically used to isolate the Wt mode at LO do reduce the 
interference term occurring at NLO, and thus the notion of a Wt production process with 
a well-defined NLO correction does indeed make sense. If such cuts are used, the process 
of consistently considering the Wt signal plus tt background then amoTints to generating 
separately samples of Wt and tt events (at LO or NLO as desired) and adding together 
the results. Similar considerations apply if Wt and tt production are both backgrounds to 
a third production process, provided the isolation cuts associated with the third process 
are such as to render the interference between Wt and tt small. The advantages of such 
an approach are obvious: 

• One can efficiently generate both Wt and tt events up to NLO for use in an analysis, 
of particular advantage when Wt is the signal. 

• NLO corrections can be included in both processes i.e. one has separate iC-factors 
for each, greatly increasing the theoretical accuracy of the description. 

• Previous analyses of Wt production at LO can also be consistently performed at NLO, 
provided (as is indeed usually the case) that the LO cuts reduce the interference term 
with tt production. 

The idea of Wt production as a well-defined process at NLO is not new. Indeed, every 
previous calculation of Wt production beyond LO (including those analyses which only 
include tree level diagrams) has had to define some prescription for dealing with the in- 
terference problem [18, 19]. These approaches were compared in detail in [22], and we do 
not repeat the discussion here. Also in [22], two definitions of Wt production were given 
in the context of a full parton shower approach at NLO. These definitions were called di- 
agram removal (DR) and diagram subtraction (DS), where the former removes resonant tt 
effects from Wt at the amplitude level (by not including the diagrams of figure 2), and the 
latter at the cross-section level. The difference then in essence measures the interference 
between tt and Wt production^. Furthermore, both of these definitions are implemented 
in the MC@NLO event generator (see [26] for technical information). By running the same 
analysis with both the DR and DS codes, one is able to check whether interference effects 
are a problem for a given set of analysis cuts, or not. 



''The reader may worry about violation of gauge invariance. This is discussed at length in [22]. 
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If indeed the interference has been shown to be small, then one has succeeded in sepa- 
rating the signal plus background of WWb and WWbb final states into two non-overlapping 
parts, which we may call VFt-like and tt-like signatures. This separation of the final states 
is, as stated clearly above, dependent on cuts. Where such cuts are used, however, the 
Wt and tt separation is a very good (and, importantly, quantifiable) approximation to the 
underlying physics. 

Successful isolation of the Wt mode requires not only that the interference with tt 
is reduced, but also that a good signal to background ratio can be obtained. For exam- 
ple, it has not yet been shown whether the size of the Wt cross-section is such that it 
can be significantly observed relative to the systematic uncertainty associated with the tt 
background. This is the subject of the following section. 

3. Isolating the Wt signal 

In this section we investigate whether it is meaningful to describe a signal of Wt produc- 
tion above a background of tt production. We require two criteria to be satisfied. Firstly, 
that the interference between Wt and tt production can be neglected, as can be checked 
by comparing results obtained with DR and DS. Secondly, that the Wt cross-section is 
larger than the scale variation associated with the tt result. The latter is an indication of 
whether the identification of Wt is meaningful given the systematic errors associated with 
the (potentially large) background, and will not be satisfied for generic cuts. 

Given that we are only considering interference aspects of Wt production in this paper, 
we neglect all backgrounds apart from top pair production. In more realistic analyses, 
further cuts should be applied, but one does not expect these to weaken any separation 
of Wt and tt that has been achieved with looser cuts. Motivated by previous studies 
(e.g. [27]), we consider the following cuts: 

Wt signal cuts 

1. The presence of exactly 1 b jet with pT > 50 GeV and \r]\ < 2.5. No other b jets with 
PT > 25 GeV and \ri\ < 2.5. 

2. The presence of exactly 2 light flavor jets with px > 25 GeV and |?7| < 2.5. In 
addition, their invariant mass should satisfy 55 GeV< rrijj^j^ < 85 GeV. 

3. The presence of exactly 1 isolated lepton with pT > 25 GeV and \r)\ < 2.5. The 
lepton should satisfy AR > 0.4 with respect to the two light jets and the b jet, where 
R is the distance in the (r), (j)) plane. 

4. The missing transverse energy should satisfy EJp^^^ >25 GeV. 

These cuts are designed to isolate scmileptonic decays of the two W bosons, one of which 
comes from the decay of the top quark in Wt. These are cleaner than fully hadronic decays 
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(due to backgrounds), but with a cross-section sizeable enough so that studies are pos- 
sible with early LHC data. Preference for the semi-leptonic decay mode comes from the 
presence of the isolated lepton, and the missing transverse energy requirement (stemming 
from the presence of a neutrino in the final state). Moreover, one expects most I^t-like 
events to have only one hard b jet whereas tt events have two b jets at LO parton level. 
Hence, the requirement of exactly one hard b jet in the final state significantly reduces the 
tt background, and also (as we shall see) the interference between Wt and tt production. 
The latter is not surprising, as it has already been shown that a transverse momentum veto 
on the second hardest b jet reduces very efficiently the interference between single top and 
top pair production [19, 22]. A cut on the number of b jets of given pr is clearly closely 
related to the notion of a veto on additional b jets. In practice, there will be a number of 
14^t-like events due to tt production, where one of the b jets in ti is either too soft to be 
detected, or has been misidentified as a light jet. 

In order to model such effects, we apply the above cuts for a number of choices of b 
tagging efficiency and light jet rejection rate rij. That is, b jets are kept with a probabil- 
ity Cb, and otherwise taken to be light jets. Similarly, light jets arc kept with a probability 
1 — 1/rij (using the conventional definition of the rejection rate), and otherwise taken to 
be b jets. Wc assume the same efficiencies for every jet. This may not be the most realistic 
model, but the hope is that considering different values for and rij adequately explores 
the systematic uncertainty due to these effects. The choices can be found in table 1. We 
also show results with ej, = 1 and rij = 10^, i.e. a default Monte Carlo calculation without 
b tagging effects or light jet rejection included. 

The cut on the invariant mass of the light jet pair helps to discriminate both Wt and 
tt production from other backgrounds. However, it also helps reduce tt relative to Wt 
production, as it requires that the invariant mass of the light jet pair lies within a window 
of the W mass i.e. that the two light jets result from the decay of a 1^ boson. Given that 
there are more jets on average in top pair production, the chance that the two jets entering 
the cuts have both arisen from the same W boson is smaller. 

The above cuts are reasonably loose, particularly given that most px and ij cuts arise 
from detector constraints. Extra cuts would in practice be used to tighten the signal to 
background ratio. However, our aim here is merely to show that even for cuts that are not 
particularly strict, a clean separation of Wt and tt production can be found. Additional 
cuts aimed at enhancing the signal should then further reduce the interference. 

The cross-sections that result after application of the above cuts are shown in table 1. 
All results have been obtained using a top mass and width of mt = 170.9 GeV and Tt = 1.4 

GeV respectively. The W mass and width are Mw = 80.42 GeV and Tw = 2.141 GeV. 
We use the MRST2002 NLO parton densities [28]. By default, renormalization and factor- 
ization scales are set to fip = p^ = nit. The cross-sections have been obtained for strictly 
Wt production, and then multiplied by a factor of two to account for t production. The 
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Table 1: Cross-sections, subject to the cuts outlined in the text, for Wt and production, obtained 
using MC@NLO. The single top results are obtained using both diagram removal (DR) and diagram 
subtraction (DS), and correspond to both top or antitop quarks in the final state. Quoted errors 
are due to scale variation by a factor of two. 

uncertainties quoted correspond to varying the common renormalization and factorization 
scale in the range mt/2 < /i < 2mt. From the table, one may note the following: 

• The DR and DS results agree to within around 3% in all cases, which is similar to 
the uncertainty in each result due to scale variation^. Thus, the interference term 
between Wt and tt production indeed appears to be small. 

• The Wt cross-section is larger than the uncertainty on the tt cross-section due to scale 
variation. Thus, the Wt signal is well-defined and visible above the tt background. 

As stressed above, both of these properties are needed before one can sensibly claim to 
be able to isolate Wt production. Also, they are dependent on the cuts applied, and the 
above cuts are a fairly minimal choice such that both of these requirements can be satisfied. 

Although DR and DS agree at the total cross-section level, it is also important to verify 
the agreement in kinematic distributions. This is possible given that both DR and DS are 
defined in a parton shower context at the fully exclusive level i.e. locally in phase space. 
As examples, in figures 3-5 we show the transverse momentum and rapidity distributions 
of the light jets, b jet and isolated lepton entering the cuts defined above. One sees that 
agreement is obtained within statistical uncertainties, in addition to the agreement within 
scale uncertainties noted above. 

One must also consider distributions for various choices of b tagging efficiency and light 
jet rejection rate. Of these, the former has a potentially damaging effect on the ability of 
jet cuts to reduce the Wt-tt interference, as these rely on cutting out events with a second 
hard 6 jet. The transverse momentum and rapidity distributions for the light and b jets 
are shown, for all four non-trivial choices of and rij given in table 1, in figures 6-7. 



^Slightly more scale variation is observed if the factorization and renormalization scales are varied 
independently from each other. We checked that this does not invalidate the fact that the Wt cross-section 
is larger than the scale variation uncertainty of the top pair production result, when //f and jiR are varied 
such that their ratio is never more than 2. 
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(a) (b) 

Figure 3: The transverse momentum (a) and pseudo-rapidity (b) distributions of the hght jets in 
Wt production (subject to the cuts outHned in the text), shown for both diagram removal (DR) 
and diagram subtraction (DS). The 6-tagging efficiency and hght jet rejection rate are given by 
eh = 0.6 and r;j=30 respectively. Uncertainties are statistical, and the vertical axis has arbitrary 
normalization. 




(a) (b) 

Figure 4: The transverse momentum (a) and pseudo-rapidity (b) distributions of the hard b jet 
in Wt production (subject to the cuts outlined in the text), shown for both diagram removal (DR) 
and diagram subtraction (DS). The ^-tagging efficiency and light jet rejection rate are given by 
Cb = 0.6 and rij=30 respectively. Uncertainties are statistical, and the vertical axis has arbitrary 
normalization. 



One sees good agreement between the DR and DS results for all choices of et and 
rij and all distributions. Thus, the above cuts do isolate Wt production in a well-defined 
sense. Note that the ratio of the Wt and tt cross-sections is ~ 1 : 4.7 (before accounting 
for 6-tagging efficiency and light jet rejection). The above, however, is a rough analysis 
designed to address interference issues. Additional observables can be used to further en- 
hance the signal without diminishing the cross-section too much (see e.g. [27]). However, 
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(a) (b) 

Figure 5: The transverse momentum (a) and pseudo-rapidity (b) distributions of the isolated 
lepton in Wt production (subject to the cuts outhned in the text), shown for both diagram removal 
(DR) and diagram subtraction (DS). The 6-tagging efficiency and light jet rejection rate are given 
by = 0.6 and r;j=30 respectively. Uncertainties are statistical, and the vertical axis has arbitrary 
normalization. 




(a) (b) 

Figure 6: The transverse momentum (a) and pseudo-rapidity (b) distributions of the light jets in 
Wt production, shown for various choices of 6-tagging efficiency eb and light jet rejection rate rij 
(normalized to the first choice). Results are shown for both diagram removal (DR) and diagram 
subtraction (DS). 



it is encouraging that even without a highly optimized signal to background ratio, the Wt 
signal is well-defined. 

The effect of b tagging efficiency and light-jet rejection rate can be further appreciated 
by looking at figure 8, which shows the average number of b and light jets per event (satis- 
fying the detector cuts pT > 25 GeV and \r]\ < 2.5, but before the full Wt signal cuts have 
been applied) before and after reshuffling due to non-trivial and r/j. One sees that the 
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(a) (b) 

Figure 7: The transverse momentum (a) and pseudo-rapidity (b) distributions of the b jet in 
Wt production, shown for various choices of 5-tagging efficiency ef, and hght jet rejection rate rij 
(normahzed to the first choice). Results are shown for both diagram removal (DR) and diagram 
subtraction (DS). 



average number of b jets is slightly below one for Wt production, even before reshuffling. 
Given that a hard b jet is required by the signal cuts, this makes the Wt cross-section more 
sensitive to b tagging efficiency than that of top pair production, as can be seen directly in 
table 1. 
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Figure 8: The average number of light and b jets before (left-hand bins) and after (right-hand 
bins) reshuffling due to 6-tagging efficiency and light-jet rejection rate. The Wt results have been 
obtained using diagram removal (DR). 



In figure 9 we show the total number of jets (light plus b jets) passing the detector cuts. 
One clearly sees that top pair production has higher jet multiplicities on average, hence the 
efficacy of the signal cuts in selecting Wt production. Furthermore, there is a non-trivial 
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Figure 9: Distribution in the total number of jets which pass the detector cuts pr > 25 GeV and 
hi < 2.5. 



fraction of events with five or more hard jets. This, combined with the fact that the signal 
cuts require three jets, suggests that a parton shower framework (rather than a fixed order 
matrix element) is indeed more appropriate for describing Wt production, given the limited 
number of partons in presently available fixed order matrix element calculations. There is 
another reason why a parton shower framework is more appropriate, namely that one does 
not necessarily trust a fixed order matrix element description of emitted partons at lower 
transverse momenta, such as those (~ 25 GeV) involved in the jet veto cuts (see section 
5.1 of [22] for a discussion related to this point). 

A comment is in order regarding the use of a sequential cut method in order to isolate 
the Wt signal, when recent experimental analyses rely more heavily on methods based 
on neural networks, boosted decision trees (BDT) and matrix element methods (e.g. the 
recent discovery of single top production at the Tevatron [1,2]). It is very likely that such 
methods will be applied at the LHC in order to isolate Wt production. For example, a 
sample analysis (at LO plus parton shower level) is presented by the ATLAS experiment 
in [27], alongside a traditional sequential cut analysis. It is not always clear how systematic 
uncertainties in Monte Carlo models propagate through such analyses, including in this 
case the uncertainty attached with separating Wt and ti. The safest way to proceed, in 
cases where there is any doubt, is to repeat a given analysis which depends on the use of 
MC@NLO for Wt production using both the DR and DS options. 

4. Wt production as a background to H ^ WW 

In the previous section, we have shown that it is possible to isolate Wt production as a 
signal. However, this is not the only context in which Wt production occurs - one must 
also consider it as a background to other production processes. In such cases (and as sug- 
gested by the results of the previous section), one wishes to use as accurate a description 
of the background as possible, which strongly motivates the use of MC@NLO. However, 
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one must check in such a case that this description is well-defined, namely that DR and 
DS agree for the cuts used to isolate the signal of interest. If this turns out to be true, one 
may reliably estimate the top background to the production process of interest by combin- 
ing samples of tt and Wt events (corresponding to an incoherent sum of the hard processes) . 

In this section, we consider an example of Wt and ti as backgrounds to a third process, 
that of Higgs boson production with subsequent decay to a pair of W bosons. This is of 
topical interest, given that the H — > W~^W~ decay mode dominates for intermediate Higgs 
boson masses 150 GeV < mn < 180 GeV, making this the only viable discovery channel 
in this window. Furthermore, the dominant background is from top pair production (with 
single top processes also significant), thus this is an excellent example to illustrate the use 
of Wt production as a background. Our aim here is not to present a detailed phcnomeno- 
logical study of Higgs boson production (see p. 110 of [27] for an up-to-date experimental 
study), but rather to examine whether MC@NLO can be used to reliably estimate the Wt 
background. 

In order to minimize QCD jet backgrounds, it is common to consider the case where 
both W bosons stemming from the Higgs boson decay leptonically i.e. 

H W+W~ ltiyil2^2, (4.1) 

where li is either an electron or muon, and vi its corresponding neutrino. Then spin 
correlations can be used to efficiently isolate the signal against top-related backgrounds [29] 
(see also [30, 31, 32, 33, 34, 35]). Motivated by [34, 35, 36], we use the following example 
cuts to isolate the Higgs signal: 

Higgs signal cuts 

1. There must be two opposite sign leptons satisfying pT > 25 GeV and \rj\ < 2.5. 

2. The invariant mass of the charged lepton pair should satisfy 12 GeV< mu < 40 GeV. 

3. The azimuthal angle between the leptons (i.e. the angle in the transverse plane) 
should be less than 7r/4. 

4. The lepton with the highest pr should satisfy 30 GeV< pr <55 GeV. 

5. There must be a missing transverse energy of at least 50 GeV. 

6. There must be no jets (i.e. either b or light jets) with px > 25 GeV and \r]\ < 2.5. 

More sophisticated cuts require isolation of the leptons from hadronic activity, as well as 
tuning of the various parameters introduced above. However, as in section 3, we choose a 
reasonably minimal set of cuts associated with the signal of interest. Conclusions reached 
about whether the Wt background can be well-defined will then apply in more realistic 
analyses. 
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Process 


/fb 


h-^WW 


81.8 ±0.4 


tt 


12.25 ± 0.3 


Wt (DR) 


6.91 ± 0.06 


Wt (DS) 


6.89 ± 0.07 



Table 2: Cross-sections obtained using MC@NLO for the H — > W~^W~ signal cuts described in 

the text, where the W bosons can decay to electrons or muons. Note that the Wt results include 
both top or antitop quarks in the final state. Uncertainties correspond to statistical errors only. 

Of the above cuts, the jet veto (i.e. cut number 6) is particularly effective in reducing 
the background from top quark production, either singly or in pairs. One could again 
consider various b tagging efficiencies Cf, and light jet rejection rates rij, but given that 
the jet veto applies to the total number of jets, these will be irrelevant in our analysis. In 
the results that follow we use parton densities, as well as top and W masses and widths, 
as described in section 3. Our default factorization and renormalization scale choices are 
again fj,ji = fip = mt, and we allow electrons or muons in the decay of the W bosons. 

For the above choice of signal cuts, the Higgs signal cross-section is (using MC@NLO 
with a renormalization and factorization scale equal to the Higgs mass) 81.8 fb for a Higgs 
boson mass mn = 165 GeV. This is comparable to the corresponding figure presented 
in [34, 35], although slightly higher due to the requirement in that paper that the leptons be 
isolated from hadronic activity^. After cuts, the backgrounds due to top quark production 
are somewhat smaller than the background from non-resonant W pair production [29] , but 
are still significant. Our results for the top pair and Wt backgrounds are shown in table 2. 

One sees that the Wt background is more than half the size of the top pair background. 
That these are similar in magnitude is not surprising, given the jet veto involved in the 

selection cuts. Importantly, the DR and DS results for Wt production agree well within 
statistical uncertainties (we checked that these are larger in this case than the uncertainty 
that results from varying the common renormalization and factorization scale by a factor of 
two). As in section 3, it is important to check that kinematic distributions also agree well 
when calculated with both DR and DS. Some examples are shown in figures 10-11, namely 
the transverse and absolute pseudo-rapidity distributions of the two final state leptons. 
One sees that the DR and DS results agree closely within statistical uncertainties. 

We have seen so far that when top production occurs as a background to a given 
process (namely Higgs boson production with subsequent decay to W bosons), one is still 
able to define Wt production as a separate background subject to the cuts used to isolate 
the signal. This means that in evaluating the combined background from top production, 

®To obtain the above number one must include spin correlations in the decay of the Higgs boson, 

particularly given the cut on the azimuthal angle between the lopton pair. These are not implemented in 
the latest public release of HERWIG, hence we use the unreleased version referred to in [34, 35]. 
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Figure 10: The transverse momentum (a) and absolute pseudo-rapidity (b) distributions of the 
lepton from the top quark in Wt production subject to the Higgs signal cuts described in the 
text, obtained using DR (black) and DS (blue). Uncertainties (indicated by the vertical bars) are 
statistical, and the vertical axis shows arbitrary units. 
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Figure 11: The transverse momentum (a) and absolute pseudo-rapidity (b) distributions of the 
lepton from the W boson in Wt production subject to the Higgs signal cuts described in the text, 
obtained using DR (black) and DS (blue). Uncertainties are statistical, and the vertical axis shows 
arbitrary units. 



tt production and Wt events can be generated separately, and the results added together 
without having to worry about interference effects. 

Some remarks are in order regarding how many of the above statements can be gener- 
alized to other processes to which top production is a significant background. There are a 
number of possibihties in general: 

• Top pair and Wt production are comparable in cross-section, and a significant frac- 



tion of the signal cross-section, but such that the interference between Wt and tt 
production is smah. This is the case considered above. 

• Top pair and Wt production are comparable in cross-section, and a significant fraction 
of the signal cross-section, such that the interference is not small. We discuss this 
case in more detail below. 

• Top pair and Wt production are comparable in size, and their sum is an insignificant 
fraction of the signal. One does not have to worry about interference in this case, 
given that top pair production itself is not a significant background. 

• Top pair production is a significant background, but Wt production has a much 
lower cross-section. In general in this scenario interference between Wt and ti is 
non-negligible, but owing to the small size of the Wt cross-section is irrelevant. We 
will see an example of this in the following section, when tt itself is considered as the 
signal. 

As is clear from the above categorization, one need only worry if the second situation oc- 
curs. This naturally presents two options. Either one can find an alternative to separating 
Wt and ti production in order to estimate the background, or one can take the difference 
between DR and DS as a measure of systematic uncertainty. If this latter uncertainty is 
large, one concludes that it does not make sense to think of Wt and tt as separate back- 
grounds. However, it seems likely that this latter situation only occurs in a minority of 
cases, given that most of the time one is trying to reduce both Wt and tt production as 
backgrounds. Given the tt cross-section is generically larger than the Wt cross-section, 
any successful reduction of the top pair background will usually render the Wt interference 
insignificant. 

Ultimately, one expects the MC@NLO calculation for the sum of Wt and tt production 
to be a good approximation in many cases. One may worry in cases where top backgrounds 
remain large, and the signal cuts do not decrease the ratio of top pair to single top pro- 
duction. If in doubt, one may run the DR and DS codes separately, and thus quantify 
the systematic uncertainty due to interference effects. Whether or not this uncertainty 
is significant depends on the process, and also on the other systematic uncertainties (e.g. 
scale variation) involved. 

5. Compcirison with WWbb 

In the previous sections, we saw that one can indeed recover Wt as a well-defined process 
at the LHC, when trying to isolate and measure its properties. We also found that this 
was the case when single and top pair production were considered as backgrounds to a 
third process, namely Higgs boson production with subsequent decay of the latter into a 
W boson pair. The analysis in both cases relied upon two things. Firstly, that one has 
a way of quantifying the effect of interference between Wt and tt production (such as the 
DR and DS codes of MC@NLO). The systematic uncertainty due to interference can then 
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be meaningfully compared with other uncertainties in the problem (such as that due to 
scale variation or statistical uncertainty of the DR or DS results), in order to determine 
whether the Wt mode makes sense. Secondly, that this interference can be reduced through 
adequate cuts. 

Nevertheless, Wt production is not strictly well-defined over all of phase space. In 
regions were the invariant mass of possible Wb pairs not coming from the primary top 
approaches the top mass, the difference between DR and DS is potentially large. It can 
thus be objected that it is questionable to try to consider Wt and tt as separate scattering 
processes, and to only consider given final states (which are well-defined). We consider 
such an approach in this section. 

In the calculational framework adopted in previous sections (i.e. in which initial state 
b quarks are present), the final states relevant to the coherent sum of Wt and tt production 
are WWb and WWbb, as discussed in section 1. Our aim is to calculate the top quark con- 
tributions to these final states, and compare the results with the description of the sum of 
the Wt and tt processes obtained in the previous sections. Thus, we do not consider other 
processes which contribute to these final states (such as non-resonant W pair production) . 

In order to obtain reliable predictions, one must combine the WWb and WWbb final 
states, and preferably interface the output to a parton shower. This raises a number of 
technical challenges (for a detailed discussion in a similar context to this paper, see [37]). 
One must avoid the double-counting that results from the presence of initial state b quarks, 
and diagrams in which b quark pairs are produced by gluon splitting (see [38] for a discus- 
sion in the context of Monte Carlo generators). Furthermore, one must apply a matching 
procedure (e.g. CKKW [39] or MLM [40]) owing to the presence of NLO real corrections 
to the LO Wt process (i.e. WWbb corrections to WWb). How to do this using presently 
available tools is not clear, given that in semileptonic decays of the two W bosons, not all 
of the final state partons are of QCD origin. 

In order to circumvent these difficulties, we consider in this section a fixed fiavor scheme 
in which the bottom quark parton density is not present. All initial state b quarks entering 
the hard interaction are then explicitly generated from gluon splitting, as shown (for LO 
Wt production) in figure 12(a)^. In this approach, there is no WWb final state, thus the 
LO contribution to top quark backgrounds comes from the WWbb state (and the Wt-tt 
interference is a leading order effect). This contains two gauge-invariant subsets of diagrams 
containing intermediate top quarks in the narrow- width approximation: (i) singly-resonant 
diagrams containing one intermediate top quark, such as that shown in figure 12(a); (ii) 
doubly-resonant diagrams containing two intermediate top quarks, such as that shown in 
figure 12(b). The former could naively be interpreted as (LO) Wt production, and the 
latter constitute top pair production. However, all interference effects are now included, 

'^A similar calculation was considered in [41], which studied corrections to the narrow width approxima- 
tion. 
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such that the distinction between Wt and tt production is not considered. 




Figure 12: (a) Singly and (b) doubly resonant contributions to the final state, where all b 

quarks are explicitly produced via gluon splitting. 



The resulting calculation for the WWbb final state can be interfaced to a parton shower 
without worrying about double counting issues, due either to b parton densities (since these 
are no longer present) or matrix element matching. Regarding the latter, there is no double 
counting between the shower and the matrix element, because in the four flavor scheme 
there is no lower order tree level matrix element that, when showered, leads to a WWbb 
final state (this is not true in the five flavor scheme, in which WWb can shower to give 
WWbb). There are also no further matching issues, due to the lack of a coUinear singu- 
larity associated with the two final state b quarks. This would be the case even if the b 
quarks were treated as massless, as in the relevant Feynman diagrams there is never a final 
state b quark pair resulting from a gluon splitting. The required tree-level matrix elements 
can be calculated (including full spin correlations in the decay of the top and W bosons) 
using MadGraph [42, 43]. We then interface these with HERWIG [44] i.e. the same parton 
shower that has been used in the MC@NLO results. 

Having constructed a calculation in which Wt and top pair production are both present 
inclusive of all necessary interference effects, we now investigate the properties of this de- 
scription, including its potential accuracy. Our strategy is as follows. We first generate 
pseudo-data for top production with tt-like signal cuts, obtained using MC@NLO by com- 
bining event samples from tt and Wt production. Next, we compare the WWbb description 
to this, and evaluate the JC-factor which is necessary to normalize the results of this ap- 
proach to the pseudo-data. Then we consider VFt-like cuts, and see how the X-factor 
needed to normalize the final state analysis to the MC@NLO data compares with the re- 
sult using tt signal cuts. If it is the same, one may argue that it makes sense to model 
the combination of Wt and tt production using a tree level approach normalized to data. 
If, however, the JC-factor is not the same for VFt-like cuts (or at least similar), this is an 
argument in favor of separating out Wt and tt production as separate production processes 
in their own right, each with a separate iiT-factor. 
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Process 



o-NLo/pb 



Total (DR) 
Total (DS) 



Wt (DR) 
Wt (DS) 



tt 




Table 3: Cross-sections obtained with MC@NLO for Wt and ti production, using the top pair 
production signal cuts of section 5.1. Uncertainties correspond to variation of the common renor- 
malization and factorization scale by a factor of two. 

The above exercise, whilst somewhat academic (since it does not include additional 
backgrounds due to other single top production modes or non-top related standard model 
processes) is a useful playground for investigating systematic uncertainty due to interference 
between Wt and ti production. By comparing the results from both calculations, we will 
be able to discuss and clarify the relative advantages and disadvantages of each approach. 
In the following section, we discuss the generation of the top pseudo-data. 

5.1 Pseudo-data for top pair production w^ith tt selection cuts 

We form a sample of pseudo-data by running MC@NLO for both the Wt and tt production 
channels, and combining the event samples. We include spin correlations in the decays of 
the top quarks (and W bosons), and the Wt results are run using both DR and DS. 
Parameter choices, parton densities etc. are chosen as in previous sections. Motivated 
by [27], the following cuts are applied in order to isolate the top pair production cross- 
section, after requiring semi-leptonic decay of the two W bosons: 



1. There must be one lepton (electron or muon) with pT > 20 GeV. 

2. The missing transverse energy is required to satisfy E™^^^ > 20 GeV. 

3. There must be at least four jets with pT > 20 GeV. 

4. There must be at least three jets with pT > 40 GeV. 

5. Leptons and jets must satisfy the pseudo-rapidity cuts \rf\ < 2.5. 

The cross-sections for Wt production and tt production are collected in table 3, together 
with their total. Note that the cuts used to isolate the tt signal do not reduce the inter- 
ference with Wt production, as evidenced by the fact that the DR and DS cross-sections 
in table 3 differ by around 25%. However, when combining the event samples, the tt com- 
ponent is much larger than the Wt component, so that the systematic uncertainty due to 
interference between Wt and tt has a negligible effect. The two combined cross-sections 
differ by less than 0.9%, which is clearly much less than the systematic uncertainty due to 



tt signal cuts 
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scale variation. Furthermore, the total Wt cross-section is less than the scale-variation of 
the ti cross-section. Thus, it is questionable whether Wt production is a significant back- 
ground at all, let alone whether ambiguities due to interference effects are significant. One 
may further check that the latter effects are small by comparing kinematic distributions 
in the two combined event samples. As examples, the transverse momentum and pseudo- 
rapidity distributions of the final state lepton are shown in figure 13. One sees that indeed 
the difference between the results for the total of top pair and Wt production is well within 
statistical uncertainties, although the pure Wt results differ somewhat in shape as well as 
normalization. 




Pt 



(a) (b) 

Figure 13: The transverse momentuni (a) and pseudo-rapidity (b) distributions of the final state 
lepton arising from combining MC@NLO event samples for Wt and ti production, subject to tt 
signal cuts described in the text. Results are shown for the cases in which the Wt sample is 
obtained using DR (black), and DS (blue). Uncertainties are statistical, and the vertical axis shows 
arbitrary units. Also shown are the pure Wt results, multiplied by a constant factor so as to be 
visible on the same scale. 

In the following subsections, we compare a tree-level (plus parton shower) final state 
analysis to this pseudo-data. Ideally, one should compare both the MC@NLO and the 
tree-level approach to real data. Since these are not available, the analysis here allows 
one, at least to some degree, to compare the relative advantages and disadvantages of each 
approach. We begin by describing in more detail the tree level calculation. 

5.2 Tree level analysis of final states 

In this section we describe the tree level calculation of the WWbb final state. As explained 
in the previous section, when considering all diagrams contributing to this final state, one is 
restricted to a tree-level calculation, as the full NLO amplitudes for production and decay 
of the relevant top quark intermediate states are not known. Given that the aim of this 
paper is to address the issue of interference effects in single and double top production, we 
consider here only those diagrams contributing to the WWbb state that have intermediate 
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top quark resonances (either single or double). 



Our calculation works as follows. Events are simulated using MadGraph^ for the 
process 

pp^W+W-bb, (5.1) 

where p denotes the proton. As explained in the previous section, initial state b quarks 
are not present, so as to avoid double counting and matching issues. To be consistent, 

we use the top quark width as calculated by MadGraph using the masses given above, 
which is found to be Tt = 1.407 GeV. The decay to final state Icptons and partons is also 
present in the MadGraph events, so that spin correlations of decay products are included. 
In both calculations, the W boson width is set to 2.141GeV, and the branching ratio for 
semileptonic decays is 24/81. 

The event output from MadGraph is interfaced to HERWIG, whose parton shower is 
also used in MC@NLO. The result is then a consistent calculation of the WWbb final state, 
with both interference and shower effects included. Using default parameters and scales as 
described previously, the result for the tt cross-section is 

a«er^ = Q^.Otll^ph, (5.2) 

where the superscript tt denotes that top pair production signal cuts are applied, rather 
than that only tt intermediate states arc considered (which is, of course, not meaningful in 
this approach). The quoted uncertainty stems from varying the common renormalization 
and factorization scale by a factor of two, and one sees that this uncertainty is sizeable. 
From this result and the MC@NLO cross-section given in table 3, one may define the 
iC-factor as the ratio of the central values of the cross-sections^, i.e. 

^tt-cuts ^ <w ^ / 1-508 ± 0.012 (DR) 

<e^'' \ 1-494 ± 0.012 (DS) ' ^ ' 

where the numerator is the MC@NLO combined cross-section for the sum of Wt and tt 
cross-section, obtained using tt signal cuts (see section 5.1). Note that the DR and DS 
results are indistinguishable within statistical uncertainties, as expected from the results 
of table 1. 



The lepton transverse momentum and pseudo-rapidity distributions from both the 

Madgraph and MC@NLO calculations are shown in figure 14. One observes some minor 
difference in shape between the tree level and NLO analyses, which suggests that normaliz- 
ing the LO results via a iC-factor is a somewhat limited approximation. The latter can be 

*Note that MadGraph includes a mass for the b quarks (mj — 4.7 GeV) in the hard matrix element, 
which has not been included in the MC@NLO calculation. We do not expect this to alter our conclusions. 

®Note that we use the same parton densities for both the tree level and MC@NLO calculations. This is 
in contrast to some other definitions of the Jf-factor in which LO and NLO partons are used for LO and 
NLO calculations respectively. This does not affect our conclusions. 
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more clearly seen in figure 15 which shows the ratios, bin by bin, of the leptonic transverse 
momentum and pseudo-rapidity distributions obtained in both approaches. 




(a) (b) 

Figure 14: The transverse momentum (a) and pseudo-rapidity (b) distributions of the final state 
lepton arising from combining MC@NLO event samples for Wt and ti production, subject to tt 
signal cuts described in the text (black) . Also shown is the result from the consistent tree level plus 
parton shower approach discussed in the text. Uncertainties are statistical, and the vertical axis 
shows arbitrary units. 




P Tl 
■^T 'lep 

(a) (b) 

Figure 15: The ratio of normalized distributions in transverse momentum (a) and pseudo-rapidity 
(b) of the final state lepton, from the MC@NLO and MadGraph (plus HERWIG) computations, 
for the top pair production signal cuts discussed in the text. Uncertainties are statistical. 

Having normalized the tree level calculation to the MC@NLO pseudo-data using top 
pair production signal cuts, one may then investigate what happens for the VFi-like signal 
cuts of section 3. Given that these depend separately on the number of b jets and the 
number of light jets, the X-factor for these cuts (defined analogously to eq. (5.3)) will 
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^0 




^DR 


^DS 


1 


10^ 


1.349 ± 0.024 


1.345 ± 0.028 


0.6 


30 


1.367 ±0.028 


1.362 ± 0.028 


0.6 


200 


1.308 ± 0.026 


1.302 ± 0.026 


0.4 


300 


1.357 ±0.032 


1.353 ± 0.032 


0.4 


2000 


1.345 ± 0.032 


1.342 ± 0.032 



Table 4: ii'-factors normalizing the tree level WWhh (plus parton shower) calculation to the sum 
of Wt and ti production obtained using MCQNLO, for the Wt signal cuts described in section 3. 
Results are shown for both DR and DS, and for a range of ^-tagging efficiencies e;, and light jet 
rejection rates rij. The quoted uncertainties are statistical. 



potentially depend on the 6-tagging efficiency and the light jet rejection rate rij. Results 
are shown in table 4, where the i^-factors have been obtained as the ratio of cross-sections 
from the MC@NLO and MadGraph (plus HERWIG) computations. The former results de- 
pend upon whether DR or DS is used for the Wt channel (although we have already seen 
in section 3 that this is a minor effect), thus results are presented for both choices. Prom 
the table, one sees that the K-factor does not depend on whether DS or DR is used i.e. the 
results for each choice of {eb,rij) are equal within statistical uncertainties. However, the 
i^T-factor does depend slightly upon the light jet rejection rate rij and b tagging efficiency e^. 

One might indeed expect each calculation (i.e. the MC@NLO approach and the tree 
level plus parton shower analysis) to depend on the 6-tagging efficiency and/or light jet 
rejection rate, due to the fact that the cuts involve separate restrictions on the numbers 
of b and light jets. However, the sensitivity of the iC-factor to rij and eb means that the 
two calculations are not affected in the same way. This is not surprising, given that the 
MC@NLO calculation has initial state b quarks whereas the tree level plus parton shower 
analysis has all b quarks generated from gluon splitting. The hard matrix element in the 
latter calculation has at least two b quarks in it, whereas the former may have only a single 
b quark. This, coupled with the requirement of one hard 6 jet and two light jets in the signal 
cuts, means that the sensitivity of the two calculations to the light jet rejection rate will 
be different. That this is not a large effect can be seen by comparing figure 16 and figure 8, 
which show the average number of b and light jets before and after shuffling in the two ap- 
proaches. The MadGraph plot of figure 16 shows that there is not a substantial difference 
in the number of b or light jets passing the detector cuts between the tree level calculation 
and the five fiavor scheme adopted in MC@NLO. Whether or not one includes initial state 
b quarks is ultimately a matter of choice, in that both schemes are perturbatively consistent. 

More significantly, the iiT-factor for the Wt signal cuts is not the same as for the tt 
production cuts but is notably lower (by 15%). Note that this difference is significant in 
the sense that it is larger than the scale variation uncertainty associated with the total Wt 
plus tt cross-section (~ 10%). That the K-iactor is lower than that for tt signal cuts is not 
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Figure 16: The average number of b and light jets before (two left-most bins) and after (two right- 
most bins) reshuffling due to ^-tagging efficiency and light-jet rejection rate. Results are obtained 
from the MadGraph plus HERWIG calculation, for the Wt signal cuts. 



surprising given that previous NLO calculations of the Wt mode [19, 18] (both of which 
give some procedure for defining the Wt process) also find that the iiT-factor for pure Wt 
production is lower than that for tt. Thus, when signal cuts are used to isolate the Wt 
signal, one expects that the iT-factor which normalizes the sum of Wt and tt is also reduced. 

One may also evaluate a similar JT-factor for the Higgs signal cuts used in section 4. 
This gives some indication of how well the background to — > WW due to top production 
is estimated, and can be calculated similarly to the result for the H^t-like signal cuts. We 
generate events for the process of eq. (5.1), including the leptonic decays of both W bosons, 
so that spin correlations are included (note that this is particularly important for the Higgs 
signal cuts, because they include a restriction on the azimuthal angle between the lepton 
pair). The branching ratio for the leptonic final state is 4/81. Next, the events are interfaced 
with HERWIG as before, and the iT-factor is then found to be 

cuts 

^m+t = Jl^ = 1-98 ±0.07, (5.4) 

"tree 

where the cross-sections on the right-hand side denote the MC@NLO and Madgraph re- 
sults for the top production background, and the quoted uncertainty is statistical. The 
Wt component of the MC@NLO calculation is obtained using diagram removal. Note that 
the result is higher than the corresponding result for the tt cuts, and again is outside the 
scale variation uncertainty associated with the latter. The former property can be partially 
explained from the fact that the signal cuts involve a strong veto on any jets passing the 
detector constraints. Some of the difference in ii'- factor can then be related to the dis- 
tribution of b and light jets passing the detector cuts in the two calculations (and before 
additional cuts have been applied). These are shown in figure 17. The differences between 
the MadGraph (plus HERWIG) and MC@NLO calculations are as expected. In figure 17(a) 
one sees that there are less events with no b jets in the MadGraph calculation, presumably 
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Figure 17: Distributions of the number of (a) b jets; (b) light jets passing the detector cuts, for 
the sum of Wt and tt production with fully leptonic decays of the W bosons. Results are shown 
for perfect b tagging efficiency and light jet rejection rate (i.e. ef, = 1 and rij — 10^). 



due to the fact that a four flavor scheme has been used so that there are always at least 
two b quarks in the final state. However, there are less events with no light jets in the 
MC@NLO calculation, due to the fact that the NLO matrix element creates harder light 
jets on average, which are more likely to pass the detector cuts. These two effects modify 
the i^-factor in opposite directions, but the net result is that the MC@NLO calculation 
has more events with no jets than does the MadGraph calculation - 3.6% rather than 3.2%. 

To summarize, the above results imply that the MC@NLO description of the sum of 
the tt and Wt cross-sections is not related to the tree level plus parton shower analysis 
by a simple rescaling. The question then is which is the optimal description, that gives 
the most accurate comparison to data. The advantage of the tree level analysis is that 
it consistently combines the Wt and tt processes so that any issues regarding the correct 
inclusion of interference effects are no longer present. However, this would seem to be 
the only advantage. The MC@NLO approach on the other hand benefits from the usual 
advantages of combining a NLO matrix element with a parton shower i.e. reduced scale 
uncertainty, and correct treatment of the first NLO emission. The latter contributes to 
shape differences in distributions, which have indeed been observed above^''. Finally, it is 
clearly advantageous, given the differences observed above, to have two separate K-factors 
for what are essentially two different processes. 

6. Discussion 

In this paper we have addressed the issue of Wt production at the LHC, focussing on 
whether or not it makes sense to consider this as a production process in its own right. 

^^There is also a resummation of logarithms ~ 0{ln{mt/mb)) when a b parton density is used. However, 
these are not expected to be important, as found in [16]. 
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A theoretical ambiguity arises due to interference between Wt and tt production, i.e. the 
same Feynman diagrams contribute to each process. In the five flavor scheme in which a 
bottom quark parton density is used, this interference occurs at NLO and beyond in Wt 
(where the relevant diagrams can be interpreted as LO top pair production, with decay of 
the antitop) . Furthermore, in order to test which solutions to this problem are viable in an 
experimental setting, one must interface the hard matrix element with a parton shower al- 
gorithm, necessitating the use of MC@NLO. The problems of implementing Wt production 
were dealt with in [22] , and the resulting software contains two definitions of the Wt mode 
such that the difference between them provides a measure of the systematic uncertainty 
due to interference effects. The aim of this paper has been to extend the results of that 
paper, by further investigating the circumstances in which such a tool can be used in the 
context of a realistic analysis. 

There are two main contexts in which calculation of the Wt mode is necessary. Firstly, 
there is the isolation of Wt production as a signal, which we considered in section 3. We 
applied basic cuts designed to isolate this signal, and obtained results using both the DR 
and DS options in MC@NLO. These were found to give very similar results, agreeing within 
other systematic uncertainties (e.g. scale variation). Importantly, this agreement persisted 
in kinematic distributions and for all choices of 6-tagging efficiency e^. Furthermore, the 
Wt cross-section was found to be larger than the scale variation associated with the top pair 
production cross-section (also evaluated using MC@NLO), a feature which is dependent 
on the choice of signal cuts. Only if the latter property is satisfied is it truly meaningful to 
address the Wt signal, and that this is indeed the case for fairly primitive cuts is encour- 
aging. This is particularly true given the hope that Wt production can be observed with 
early LHC data (see e.g. [27]), in which case one does not want to have to pay too much 
of a penalty in the Wt cross-section in order to strengthen the signal to background ratio 
with respect to top pair production. 

The second main context in which Wt production occurs is when both this and top 
pair production are backgrounds to a third process. We considered such a case in section 4, 
where our example signal was Higgs boson production with subsequent decay to aW boson 
pair. We found that, for the cuts used to isolate this signal, the cross-section for top pair 
production is comparable with that of Wt production (i.e. within a factor ~ 2). Thus, it 
is imperative in such a case that Wt production be taken into account. Furthermore, the 
DR and DS results agreed very well with each other, and certainly well-within scale vari- 
ation uncertainties. The agreement extended to kinematic distributions, and we showed a 
couple of examples. The question then remains of whether one has to worry about interfer- 
ence between Wt and tt production for other possible signals, and we discussed a number 
of possibilities. The most general advice that can be given is that if there is any doubt 
over the validity of separating Wt and tt production, a given analysis can be repeated 
with DR and DS in order to estimate the systematic uncertainty involved. This must then 
be compared with other uncertainties in order to gauge whether or not the analysis is valid. 
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The possibility remains however of not trying to separate Wt and tt production at all, 
and always attempting to include all Feynman diagrams in a consistent calculation of given 
final states. We discussed such an approach in section 5, in which we interfaced a tree level 
calculation of the WWbb final state (in which all initial state b quarks were generated via 
gluon splittings) with the HERWIG parton shower algorithm i.e. the same parton shower 
that is used in MC@NLO. We normalized this calculation to the MC@NLO results for cuts 
used to isolate the top pair production signal. We then evaluated corresponding X-factors 
for Wt signal cuts, and found that the factor needed was different to that obtained for the 
top pair production cuts, indicating that one calculation is not a straightforward rescaling 
of the other. This was further confirmed by the K factor for the Higgs signal cuts, which 
was different again, and large (~ 2). These results are not surprising, given the differ- 
ence between the two approaches, and raise the question of which is the right approach to 
adopt. One could claim of course that the MC@NLO calculation, in neglecting interference 
effects, is flawed. Or, that the estimate of systematic uncertainty provided by the DR and 
DS codes is not a good estimate, however this can be obtained. We believe that such a 
viewpoint is unduly pessimistic, for several reasons. 

Firstly, the fact that the MC@NLO approach neglects interference diagrams (i.e. di- 
agrams with a top pair intermediate state, where the invariant mass of the antitop is far 
off-shell), whilst an approximation, seems to be a very good approximation throughout 
much of the phase space. The evidence is presented, through numerous examples of total 
cross-sections and kinematic distributions, in this paper. Furthermore, any fixed order 
calculation is an approximation to the underlying physics, and one must carefully consider 
of a number of alternatives which gives the best approximation. The tree level approach 
described above, whilst a consistent combination of Feynman diagrams, suffers from a large 
scale uncertainty, as is typical of LO calculations. Given also the fact that the ii'-factors 
for the two sets of cuts also differ outside this uncertainty, it seems natural to concede that 
MG@NLO provides a better approximation of the underlying physics than the tree level 
calculation matched to a parton shower. 

Such a conclusion is fortunate also for practical and technical reasons. It is clearly 

better, if Wt and tt can be separated, to have the possibility to normalize each separately 
to data. This allows greater flexibility in estimating the top quark backgrounds to other 
processes. Furthermore, in searching for single top production it is useful to have a means 
of efficiently generating events which pass H^t-like signal cuts. MC@NLO provides a solu- 
tion to this problem, in that it cleanly separates Wt and tt production as far as running is 
concerned. 

To conclude, we have critically examined whether one can separate Wt and tt produc- 
tion in a number of contexts. It seems perfectly possible to try to isolate Wt production as 
a signal at the LHC, and existing LO analyses can be profitably generalized to NLO using 
MC@NLO. 
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